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Different v^iewpoints regarding educatiOMl testing 
a^a described. Bhile soie people advocate continuing r.el*ah^e upbA^ 
standardized tests/ others favor the discoiftinuation of such * / 
achi^veient and .intelligence tfests. Ths author recoiiehds a ^^d^rate 
vi^v soaevhere between these tva extfeaes. Probl^is associated yith 
standakrdizeti testing iff the schools incrlude the tendency to evaluate 
educational prpgraaa accordllng to their effect upon •test scores^ the 
disccfntinuaction worlthwhile programs that do not contribute to 
iaprofed test scores^ and the amount of ti»e required to adulnister 
test , batteries. >rotleas associated with inxjorrect interpretation of 
itttelligettce. tes :sr and contents on Jensen's article regarding the 
relative iipact >f geh^tic and environaental factors on intelligence 
are also discuss sd. The growth of the testing industry is described^ 
and |lternatives to^resent lethods of testing are suggested. The 
'•author recowends. that those gi^oups aost. i^nvolved in the testing 
process^should begin^ iir a leaningful way/ to consider ^the .uses and 
Abuses of testing. (GDC) ' ' * '[I 
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In November 1972, educators from several pdrts of the Uni- 
ted States met art the University pf North oWkota to discuss ^ 
some cc^unon concerns about the oatrow accountability ethos 
that hajd begun to dominate schools and to sKare what many 
believed to-be more sensi]til% me^uns of both opcumenting and 
assessing children's learning. Subsequent m^etingj, much 
shariftg of evaluation information, and financial and moral 
support from the Rockefeller Brothers Fund h^v^ all con- 
tributed to keeping together what is n6w callipd the North 
Dakota Study Group on Evaluation. A major goal of the 
Study Group, beyon4 support for individual participants, 
and programs, is to provide materials for teachers, par- ♦ 
ents, 'school administrators and governmental decision^ 
makers (within State Education Agencies and the U.S. *^Office^ 
*of Education) that mig^it encourage re-examination of a ^ 
range of evaluation issues and perspectives, about schools 
and srchooling. 

Towards this end, the Study Group has initiated^ a 
continuing series of monographs, of which this paper is . 
one. Over time, the series will include material on^ 
among other things, children's thinking, children' s lang- ^ 
uage, teacher support systems, inservice training, the 
schooTs relatipnship to the larger community.. * The intent 
is that these papers be takep not as final statements--a 
new ideology, but as working papers, written by people 
who are acting 5n, not just thinking about, these problems, 
whose implications need an active and considered response. 

^ ' * I^ttc? FevTone^ Dean 

Center for Teaching § Learning, 
*^ diversity of North Dakota 
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Introduction 



John D. Williamsx, Profes-' , 
sor, Center for TeacHing 
aAl Learning, The Univer- 
sity of North Dakt)ta. His 
teaching are^ is applied 
statistics. His articles' 
have appeared in Multivari-' 
'ate Behavioral F^H^Bearoh, 
The Jom*rial' of Experimental 
Education, The Journal' of 
EduoatibruH Research, Edu^ 
aationat and Psychological 
M^urement, The^ Journal 
of Genetic Psychology, The 
Journal of Psychology, 
Peyahological Reports, Be^ 
havioral Science, Percept 
tual and Motor Skills Re- 
deca^h in Higher' Education, 
Multiple Linear Regression 
viewpoints, a.rd several 
otfier professional jour- 
wis.' 



The debate regarding standardized testing, in geners^, and 
intelligence -testing in particular, has quietly 'risen to 
a conunanding posit ion, con the agenda of alL those parties 
trying to influence the policy of American ^education. In 
the process, the deb^e has trimmed out and clearly marked 
its boundaries . w 

One edge p£ tjie debate Is cejitered on the age-old 
heredity-environment controversy ; *this aspect of the de- 
bate was renewed by Jensen (1969), with Shockley (1971) 
and Herrnstein (1971) in a supporting' cast role. Their 
.(Collective view on the issue of heredity and , environment 
as^ it affects, intelligence might be characterized by say- 
ing heredity predominates and accounts for perhaps 80 per- 
cent of human variability in intelligence te^t scores."" 
Their protagonists (Sitgreaves, 1961, Kagan,/1969, Lewon- ' 
tin, 197^, Fehr, 1969 and Morris, 1972, te name but a few) 
a^e not sp easily classified as to the' causative agents of 
human intelligence, but they do agree that it is not due ^ 
to heredity, a loae . ' ' 

Another edge pf the debate centers- upon test validi- 
ty, either as it Applies to particular items or as it ap- 
plies to .testing subgroups not originally measured in' the 
norming of tfl^^^t. A most provocative explication of 
the, inappropriatene^s and inaccuracy of some test items 
that appear on widely available tests was m^de by Hoffman 
(1962). More, recently , the March/April 1975 issue of 
Principal was devoted to "The Myth of Measurability" and 
included ^rticjes criticising the it^ms.that make up sev- 
eral tests , .including the Large -Thomdike Intelligence 
Tests, t+ie Ueohsler. Intelligence Scale for Children,, the ^ 
Otis-Denon MeAtal Ability Test, and The Iowa Teste '6f 
Basic Skills, ^o name byt a few. A subsequent issue 
(Jul)6/August) was devoted to *'The Scoring of Children: 
Sftindardi zed Testing in America." This second issue was 
*even mqre critical 'of the testing industry, calling into 
question almost every one of its practi^ces. 'The opposite 
view is expres^sed by the test-constructing industry and 
Its defenders . ^ ' - 

Another edge of the debate centers upon test usage: 
How are the schools usin'g (or going to ^e) th^ datta a- 
vailabi^ on each child? If the school is to develop 
learniii^ settings for "non-«rfiatinstream" qhildren (classes 
for the learning disabled of classes for the gifted) , how 
impoi*tant should be the rale playeci by standardized test- 
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ing? One point 'of view is that ^if the labelling process 
■> overrepresents or underrepresents an identifiable sub- 
group (females, blacks, Spanish-surnamed^ lower socip-t 
etonomic statuj, left-handed persons or Catholics) , Itften 

** the test is discriminatory and should be abandoned, Pa- 
dilla and Garza (197?) point out that Spani sh-sumamed 
children'have a 2-to-2 1/2 times greater chance of being ^ 
L involved in^classes for slojv learners than,QtF)ers in 

* ^<Ce»as and California. They contend that "IQ*'* tests- -are 
* an inadequate measure for the Spanish-surnamed bt^alise 
they do not sample the cultural and linguistic experi- 
ences likely to be. a p3tt of the child'§ ba:ckground, 
They*«^3l$o are among several professional educators call- 
ing for a moratorium on intelligence testing ofMninority 
children. Far ^rom trying to di^fend itself, or even^ 
turning a deaf ear to the criticisn^ the test ind^st^ 
has seen this^ as an opportunity to sell more and newer 
tests. la fact, '4 plethora of nests have* been recently 
rewritten for specific minorities; 'Samuda. (1975) has -^an 
e^xtensive annotated bibliography of tests for minority 
students. , * 

UNDERLYING PREMISES ^ 

Perhaps much of the debkte alluded to in the pre- 
ceding section IS due to very di f ferent - underlying pre- 
I mises of the various par|^c*i pants , To take two "straw 
men" at each node of the j:cntinuutn, a staunch defender 
of standardi'^^d testing might reason that the 'tests, for 
^the most part, hay^been rigorously standardized in field 
testing and that a viable product has been ^hieved; the 
corresponding *'ant i -tester at^ the other enCN^^ the con- 
tinuum, might reason that tests 'f^at are made uja, of such 
fallible Items as those that occur on typical tests can 
mean no more in their' suip than the contribution of. each 
item; if the items are as questionable as they se^m ^o 
be, then the scores are most likely meaningless. Fur\ 
th^er, if the tesfs empirically show themselves to be used 
to place people ^into slower learning situations and an 
identifiable subgroup has mo^e than its '*share" of people 
so identified, this is pr'ima facie evidence that the test 
is discriminatory toward the affected ^roup. It is 
» clear, then, that peop-le with such discrepant .belief,, sys- 
tems w\ll likely look at.the^s'ame set of data and draw 
entirely different conclusions. . ' - 

The picmises of the present paper are few. First,' 
, in rjEjgar.d'to thfe teaching- and learning of students, it 
is felt th^t students should be irt learnifi^ situations 
that allow them' to maximize their learning potential 
while minimizing the interff rencef of , non-learning situa- 
tions. No specific learning environment is prescribed; 
indeed, if Hunt (1971) is right, stuTCnts shoul4 be 
matched with learning environments that best fit their 
conceptual functioning;, what might be a maximal ly" func- 
tioning environment at one stage of the student's le'^r 



*^ne recent author who con- 
tinues to use this concept 
is McCail (1975) In an 
attempt ^o explain the in- 
telligence and hcreditv 
concept at a fairly ele- 
mentary 1 eve 1 , McCa 1 1. opt - 
ed for a simplistic defini- 
tion of an intelligence 
test' score. Perhaps, part , 
of the rationale for thi^ 
is. that ^the tprm "IQ" i- 
ubiquitous in the jargon 
of nOn-psychomct r 1 s Bji 
faplta^l 1 ng on the famJl- 
Uacity o.f this term, Mc- * 

ill may have felt that 
the reader's ^n tercet aii^ht 
'more l^ikcly he i?ept i v at c-d . 
and, 'in the process, maKo 
him more fuflv informed on 
th6 intel I igencc-hcrcdi t> 
.issue. Unf ortunate 1> , the 
perceptive reader may set* 
the "flaw';' in the defini- 
tion of intelligence, and 
"thus reject the point of 
McCal r wri ti ng. * 



ing may be inappropriate at a lat^V stage. 

A second premise is that the -conventional wisdom of ^ 
any group is always open to question. As the 'convention- 
al wisdom of any group has a way of: being nyodified over 
time Cperhaps because not all members of a group accept 
the conventional wisdom)^ .typifying ^the <ite^ter" oi^ the 
'•anti-tester" is always' time«»dimensiDned and hen(^ inac- 
curate shortly 'after attempting to typify, or "stereo- 
type," that conventional wisdom. For example, thpse in- 
volved in 4ntelligence tjfsting sufely, n^onger include ^ 
in their thinking such-j/romides as (a) intelligence is. 
fixed and unchanging; [bj without considerable intelli- 
gence (say"v intelligence scores of 130 and above as mea- 
surejd by Wechsler^s Adult Intelligence Saale}'^, such at- 
tainments^ as graduate degrees are impossible (or/ at least 
highly imMrob'able) ; or (c) 'crime and lower intelligence 
are inseparable. * '/ 

J Interestingly, t\e critics of the inteyigenqe 
tf sting movement are guilty of at least ©ne jU}ot faux- 
pas whqn they continually refer to "IQ" tes/ing. The 
IQ, or "Intelligence Quotient," has passed/from the lexi- 
con of most psychometricians ^ho engage^ iFl intelligence " 
testing. There' were too many inconsistenciesv in attempt- 
ing to arrive at a measure of 'intal ligence\by the fabled 
formula, IQ = ^ (100) where: 



,IQ = LjjNL intelliisgericfe quotient, 
MA = the mental age, and 
C-\ = irln-ono logical age.* 



One problem is that the gradients of learning are not 
<;mooth enough to allow the'^IQ scores to have a sufficient, 
degree of predictability in longitudinal studie's^ 'T^e 
constructors of'thc otanford-Binet test coined the' term 
"Deviation Quotient',' and developed norms at each age as 
an alternative to the difficulties of the traditional IQ. 
Other intelligence test constructors (and users) have' 
opted for the term "intplli^nce score" orj even "academic 
apt itudc tcit."* ' • 

The point' that test constructors and allied psy-. 
chometric personnel are not particularly impressed by 
condemnations of a term (iQ) that has been out of general 
usage for more than a decade. To be fair, -the term'IQ^ 
conttnues in the vocabulary of Jensen, Shockley and 
Merimtein. Perhaps the psychometric community is being 
inadequately represented in this debate? 

The conventional wisdom of the ^^'ant i- testers" is 
not nearly ^o closely delineated ^is it ifS for the "test- ' 
crs"; those opposed ^to stafHiardi zed testing fiave almost 
^as many positrons on, testing as there arc people in oppo- 
s4tion to the testing movement. For example, one might 
say that the present conventio^ial wisdom asks that a mor- 
atorium be made on intelligence testing in f^ublic 
schools. Others might hold that this 'morato/ium b^ made 
on utilizing data from some vej-y specific subpopulation. 
"In between these two points occur many clearly staked-out 
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positionk. • . * . , ° 

A third premise holds that afteimative explanations 

' and/or ^research methodologies may show a very different 
light on a given topic. For ex^Jnple, in ?:egard to the ' 
heredity-environment issue, it seems logical to lo^k .out- 
side the usual educationist attempts at research ahd^at'' 
least become aware o^ny research relating to* intellec- 
tual functioning 'fro/ a genetipist's point of vifew. 

It should be (Vlear .from' these premises^ then,' that 
the point of view being developed is a dynamic Viewpoiiit;,^ 

.as more evidence accumulktes , from whatever its source, 
some rethinking i,s necessary. So is it nece$s£uy that no 
evidence be discounted because its source fails tto pro- | 

rdtide' the proper crecjentials , either educationally or phi- 

\osdphicatly. Whether it is presented by a black>?oman 
with\Qnly an eighth grade education or by a person such 
as Shpckley, who is neither a psycholpgist nor an educa- 
tionist? the^eyidence is tiot to be rejected with ad homi^ 
nem argument's . • ' ' 

Such an ad hominem ^argument apjiears to have been 
made in an -otherwi^ reasoned- def^iftse^of the testing 

'movement by Ebei (1975,*^. 83): 

» « " ' 

Edycation is blessed with a great many capable and' 
ded&cated teachers *and administrators. But »the- 
profession also has its share* of mediocrity and of 
faT-^e mes3iahs. It is from the latter ^oup that 
the loudest protests are,_heard against tests and 
\ t e sti ng . ' ' 

fo be placed in the caap of "mediocrity and of falsie mes- 
s^ahs" simpiy because one hoMs a view different from the 
"eVpert" is not v.ery comforting. \ Green (197S), I thjLrSc^' 
di^well to point out that the majority of t^ose who' 
oppise testing do sp from the point of view that , minority 
group interests are often violated* by abuses of the tests. 



Jensenism and Alfti-JenserHsm ' y 




Perhaps no scholarly; publicat^pn has atti^acted more at-' 
tention. 'than Jensen *s lengthy article in the Harimrd 
Educational RevieiA' The reaction against him has been 
so strong in some quarters that it' seems compelling 
find .out its cai^e. At the American Edycational Res 
, Association Convention in Chicago, in Af)ril 1972/ where 
'Jensen -^was scheduled to adjjre^ a reseaith audiehce re 
gardin^ his findingSs^ Chicago city teadWrs not- only 
picketed the convention, urgi]^g researchers not to at- 
tend Kis presentation, but several stormed the Wlll, arid 
among other indignities 'shown to Dr, Jensen, his notes* 
were stolen. Moreover, demonstrators causjed so much dis- 
ruption that his presentation' was cancelled, por such 
an unusual amoqnt of attention to be foiste^ upon a re- 
searcher^ (albeit negative %f tfentiort) , it behoo\es other 
researchers to at least find out about the squabble and 
the research that generated it. What, in f^ct^^has been 
the case? ^ 

' ' I,n the past decade, perhaps a handfuT^of education- 
al research studies have captured the public's or rath- 
er the media's, attention\ Besides Jensen's Work, -Cole- 
man et aK *& (1966f» monumental ar\al/sis iDf daja conduct^ 
ed at. the request ^of the then President Johnspii is no- 
■table; Ros'enthal and Jacobson's (1968) study f^cu^ed at-^ 
tentiqn on*teacher** attitudes toward students and 
helped to popularize the term "self-fulfilling prophecy". 
Jericks's (1^72) reanalysis, of Coleman's data*^s an at* 
tempt to ass%ss, to some degree, Jensen's fiTtdii\gs, on ' 
the importance of heredity'. In each cfase, very few peo- 
ple, including researchers, appear to have reached back 
to the original source materials for their inform^fction, 
but instead have* relied on the media or cpndensatiqns 
and/or rebuttals by others. Thi#. is understandable to 
an extent, as each of these research ^efforts is lengthy 
and would require detailed examinatipn. This is parti- 
cularly true of Jencks's study; if one were not a statis 
tician, understanding Jencks >s^results would almost seeisf 
to be out of the question.^ But of those profHn^nals 
wjio have responded in print .regarding Jensen's ^^Pt,>the 
criticisms are ijdt nearjy as strong as the media might 
have led us to beli,eve. For example, take the reaction 
of a noted "anti-tes<:er Banesh Hoffman, In a recent^ 
interview: v ' ^ 

. / ' . 
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• Whei) Jensen wrote his paper, I read about* it in 
\ *the New Yo.rk Times and 1 was Jjighly incensed.- I 
wa"s ready to go out on the h«sting« and write a 
letter to the* Times ,^ and all that. TJien I thought,'^ 

.before 1 do that, I better Vead'his ^pajftr. I did 
/ so and'was surprised to find that it was feally a ^ 
serious, honest ^paper. I tbink, however, tF^at Jen- 
sen has not realized one important thing that is- 
vei^ h^rd to measure, and that 'is the effect en- 
vironment has on a person. For instance, if you 

•are a^black child, you can Sense the hatred that 
is focused on you; y<^i realize fhat if you do any^. 
thing good^ no one. is &oi,ng like it comine from 
a black kid. *So you have children growing ^ in 
an atmosphere of oppression and hatred* I aon*t 
thifik that Jensen x-ealized how terrible^ that is, 
how it can stunt the intellectual and ^mbtaona'i " 
growth of a person. (Ban^h Hoffman, interviewed. * 
by. Houts, 1975, p. 36.) ' 

' ^ ^ t • /-^ - * ^ 

A review of Jensen's article is in o^der. 

- r . ■ ■ - 

A REVIEW, OF JENSEN'S ARTICLE' 

The ^Vt^icie begins, ironic'ally, with the ^oi^t on 
-vihich his evidence is weakest: compensatory education 
has fa^iled. He does point out that some of the ea^Ix^ ef-. 
forj:s at compensatory education •(Pi;oject Headstaft and 
related activities) have'oft.en had less, than glowing suc- 
cesses. But he does not consider the perspective of those 
in the community served ,by th'e^ various programs*. To* a * 
bia*ck, Chicano, or Indian, the' compensatory education pro- 
grams of the 1960^mua.t have seemed something leis than 
a panacea for all^heir wreeds. (In fa^ct, that black ai* 
Chicano or Indian might have remarked, '^Isn't it funny 
tha't a^l. the massive amounts of money available always 
go into white hands? The a'dministrators of the program 
are white; the teache}s ar^*whit-e; and all the materiafs ^. 
bought are boughjt from white businessmen. * Don't they 
trust usr*) * ' . ' * ' 

Next, Jensen ventures into a standard texti)ook' ex* 
planation of t4ie natire of intelligence (24 pages). Ip 
it, he considers Several conceptions of intellig;ence, but 
opts for a pragmatic solution: measiUre« certain kinds of 
behavior, look at their relationship to other phenome*na 
and see if ^he relationships make any sense, "while $ome 
mi^t have grave misgivings aboi*t not pinning down, the 
entity (intelligence), Jensen's approach is^ fairly stan- 
dard in psychological researchr., "Jensen oupt^es several 
studies regarding the correlation^ of in^Iligence mea- 
sures and indices of socio-economic status (thVjr's range 
from .42 to .71) . ' * ' ^ 

Jens6n alsd enters a dis.cussion of genotypfe (the * • 
genetic make-up) and ph'onotype (an observ^blQ or measJr- . 
able chaif-acte^f istic of an organism). The^square of the 
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correlation betwecfr the genotype and tHe phenotype* is 
called t^ie ht^re^itability of that. t)^iti»^e discusses 
the distribution bf inrel 1 i gei^ce^ S^cotes^ \je fails, how- 
ever, to go ^ntC.the conrstwJction 'o^^ \«teUigencre tests, 
a shortcomingi 'I find singularly , perplexing*.* 

' Then Jcnseh consxd^rt' the inheri-tar^Cle bf interli- ^, 
gence from, a genetic vipwpdint» , Firsi , ;l)e\fe views the ^ 
work of Burt. (1958; h9(T6y, whicfi was ari'at^tempt to sort 
out the proportipn of ,the,^ari^ncre tiri. im}^*lligence t^sts 
re 1 at ingi? to 'genetic and envir^ifSteTitialVWctor^ . ^* S^i^opd, 



*Whife the ^construption of* intelligence tests- is' neces-' ~ 
sari ly heavily statistical by nature, much 'of th^ /Con- 
struction process uses methods ordinarily, tajjght i'Ti a 
first course. -Wechsler*s is quite illuminating ijJ this 
regard both because the test highly^ respec\ed*, and 
because the, manual^does an excelleat job of describing 
thd normmg process. * . i « 

Wechslef has two scale^ for chi Idren, Wechsler^^s 
Intelligence Scale for (Children .(WISC) , ages 5-15, ?nd 
the- P-ze^sZer* Preschoot and Primary Scale of Intelligencf 
(WPPSIj, *ge^ 4'-6 1/2, a rid ^ a scale for ajiults, Wechsler^s 
Adult iKtelligence Scale:(miSy. Th^ ^JSCMocljides six 
performance tasks atid six ^erbal tasks^ The W^IS. omits 
one performance task and ij, of cour^' a|> a more diffi- 
cult level, fl'he ope omitted task is^e mazes tasjjf* 
The reason for its om»»4on says quite a bit about in- * 
telligcnce test construction;^ among qhi Idren, . there ar^ 
no Significant s^ex differences on» any of the 12 tasks: 
For adults, men score s I'^nificant ly ^higher on the mzv^ 
task; becaus> an assumption is made that there is no sex 
.difference on any of thre 12 tasks, *those tasks that do 
shov^f a differ^ence are discarded. (One mi^ht ask, as , has* 
been done*by Bane (197,4), why don*t we get rid of items'^ 
that show race discrimination, thus render4ng as aca- 
demic the whole issue brought up by Jensen?) ' 

Each task tha^i*' comj^lt^ted' on the/Wechsler tests 
yields a scaled score. (not necessarily a- pofjt for each 
correct response)^;. 'tke scaled scpres a.re then added,, a^d 
their sum is^ transformed to a new veiled score, the in^ 
tel 1 igenoe -score. J^^igkially, WechsJer standardised his 
data so that the^TiAn Would be 100, thfe 'Standafd* devia- i 
tion would be'l.S,'rMid there^ would be no'sex differences. 
Further! the data were scaled so that normal distril^u- 
tion would result. Nevertheless, some interesting ar^om- 
alies occur in parts of the test. For example, no more , 
credit IS given for getting all the/items correct than,- 
ifi the longest. digit span is mtssed". 

** Burt's data appear. to be the ^eeds of Jensen's con- 
cept that intelligence is 80 pertent determined by genet- 
ic factoirs.; l^'hile it is difficult to criticize an au- 
thor on a second-h*and basis-, ijt appears Burt i4Sj?(| an hi- 
erarchical modol (see Cohen, 1968) that "^insurd^ the pre- 

■ . . , 



^ ^ -^^ he reviews .thdi^va'Tious. kinsVip stud|^i^B, which, in turn,''* 

^ ' , ' * ' . were «arLier reviewed by Jtf lo^nmeyer-Kimling an^ Jarvil^ 

♦ • ' , (-1963). , He also reviews ^e twins studies, ^fr 6m which 

* * • he draws several points" firsts that correlations in., 

, intelligence scores -of identical twins are'systematicaTly 

' higher than for ftat-etnal twins; second, ...that identical 
'V- ' ' . %win^'yho have been separated tend to fiave* more similar 

/ ' , . 'test scdres'than fraternal ''twin^ raised Together.* 

, '^^r * Finally, Jensen *^revi^s studies regarding environ-* 

l^Tsln'^sfof thSe' /"'^""i corrplates of' i^tel ligence . getting td the coie 
stufties. • • ' , issue only on page 78 of a 123-page articl^ In fact, 
* ' ' ' ' * had the first two pag-es of the article been omit{eti, and 

. . ' ' had the article ended on page 77, *Jensen Kould proba'bly 

stirl be an obscure educational psycho Togist . But. after 
* ^ • page 77, Jensen gingerly rationalizes' why it is at least 
* , . • interesting'to investigate ract»l differences in inteir 

r - ligence, as racial differences in many othfer aspects of 

'5 . - , . ^ ^ humari^ liTe have bpen investigated. As Jensen passed 

^ * . point ^^f no return (discussing racial df f^erenres) , a 

_ • * grossly oversimplif ied**synQp§is of his reasoning might 

' . , . ; be useful,. , First, he attempts to show that i'ntelligenc^ 

is 'to some degree (he says' 80 percent) hereditary; sec- 
ond,^ he attempts to show that blacks score an averagers 
' pointil below whites on intelligent^ t«sts; third, Jie at- 
, tempts to show thaj^ompensatory ".education has failed, 
V-* His evidence for the firsr point is rather substantial. 

. , . ' , His' evidence on the se^pnd point' is somewhat weaker, bu^ 

« - given the misg;Lvings, at, least -the -relation of- the evi- 

, ^ ' dence is unable, The**tnifd,-p'oint has little documenta- 

• , -tion in the article; -it ^eems to be the crux of his pre- 

sentation, however,, , ' - ' 

A little^ discussed point fi?bm »this part -of Jensen's 
article rela'tes to the American Indian. The American 
..^ - ' - Indian is s^id to be , * by far the most enviror>- 

mentally disadvantaged group' . , . yet th(Bir scores^'on 
intelligence tests put them only seven to eight points 
^ \ below the Caucasian mean. ^ Indeed, on at least^ one . test 

(Denhis,* 1942), American Indians haye\been shown scort 
higher fhait'Caucasians , ^ In fatt, -a group of Hopi had a 
* Nnean intelligence score of 124, ^clearly' superior to most 

Caucasian samp-les. Jensen argues Jthat *if environment is. 
causal, to the blaok-Caucaslan differences in intelli- ' 
' , ' ' . gence, then>^iow^an environment logically be related to 
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dominance of genetic factors.' Had environment been usfed 
a? the 'f irst-ord^ed va^i^ble, ar very different estijnate 
, of •the importance of h^ed^ty would probably have been ^ 
found, Kamin (1974) thoroughly' attacks the Burt studies 
on the grouiidS that their lack of adeqjiate r^porJting and* 
, apparent .assessment process of adult intelligence on'a 
^ norl-testingvbase reduces them to an interesting commen- 
#^ary on psycho Iffgical re^eafch in the middle third of 
thp twentieth century, clearly lacKing in terms of to- 
' day's s'tancjards. * 
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.the Indian differences? , * • 1, 

Jens'ep tliQn Considers .a eugenics argument that does 
have a database. Apparently, uppetv!and middle qlass 
blacks have ^ substantially lower birth* rate than their » 
Caucasian counterparts. The reverse is*trup oi lower 

^' class blacks and 'Caucasians . To whatever degree social 
class is corriBflated to' intelligence, and t1ta.t relation- 
sKip*, in. turn, is* correlated* to heredity, Jensen conjec- 
tures that future studie^^ould shoW larger discrepancies 
between the two races oi\,^easures of intelligence. • 
t The' remainder of sJeasen's work is concerned with 

. the Various studies of compensatory educatipn: The imptfr- 
' tanf point'^n the later pprtion of his article i^s' that, 
in separating intelligence into two levels (LeveJ One,.. 
Associative Learning, and .Level Tw6, Conceptual Learn- 
irife) , " schools "tend to emphasixe conceptu&l learning, 

» jtfhile tt!os© at the lower ability levels achieve a greater^a^. 
diegree of succes's with associative learning. Jensen 
would encourage a much gxreater emphasis on associative 

• learning for lower ability students'. ' 

REVIEWS OF JENSEN'S ARTICLE \% Sl^SEQUENT ISSUES OF T 

HARVARD EDUCATIONAL REVIEW \ ^ 

In the^ following two issues, the Harvard ^Educa:^ 
tional Review^ ^\xb\\s\\ed several critj^ques of Jensen's 

^ paper, written before and a'fter the paper had become a 
ral lying flag. Kagan (1969) suggested that Jensen's ar- 
guments were not compelling, but only suggestive. Clear- 
ly his cpnclusaons reg^ding corpj^ensatory educati|pn, 
Kstgajiv wrote-, were inappropriate, on several grounds; it is 
illogi'cal to use current evaluations to dismiss all pos-^^ 
•sible compensatory programs. J. McV. Hunt's (1969) crit- 
icisms were astonishingly milti; his greatest. criticism 
focused on Jensen's having staged "compensatory education : 
has been tried a^d it appears -tp have failed." Hunt saw-^ 

' •that as a half-truth placed at the beginning' of the papei^* 
for it& dtamatic effect. . Crow h^69) , a geneticist, 
agreed 'for the most part with J^sen's analysis, but ^ , 
pointed out Ihe 1 imitations* of the mathematical assump- 
tions, the sample s^ze, and lack of evidence ' regarding " * 
changes in the;environment that had not yet been tried: 
/ * Berei;ter J1969, p, 310) stated, "My own view of the 
future of individual differences and their social conse- 
(^uer^es is -even le'ss optimistic than Dr. ,Jensen*s. The - 
hereditability of intelligence is unquestionably high 
. . . with further social progress, its reliability can - 
only increase . . * CpnceplfMal ly compl^ex machines such 
as computers that are understandable to only a small per- , 
c^nt of the populace will inevitably magnify individual 
differences in ability, Bereiter wrote. Elkind (1969) 
described a Piagetian ''conce|#ion of intelligence; he also * 
related that his evide»ice injdic^es that .the longer 

we delay formal instruction, up to certain limits, the ^ ' , 
greater the period of plasticity and the higher the ulti- 
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mate level of achievement." Thus, nursery schools (and. 
the compensatory '^ograms for pre-school children) mi^ht 
b$tt*r tr>^to create experiences that have immediate ^ 
vaTu^ for the clvild* ! - 

• Cronbach (1969) agreed to some extent with Jensen's 
findings, but found the Levels I and II di^tinctiqn in 
intellectual fuijcffoning to be an oversimplification. 
Br^zziel (1&69) tfook .umbrage with J^nseti regarding the* 
application of -similar restCarch to the blacks in.^e 
^outh. His concerB was that racists Will use the article 
n an attempt to continue or reestablish as much segrega- 
tion ^s they caa legally achieve,. 

Clearly, the earlier criticisms, written pVior to' 
the publication of Jensen's article, were not nearly so 
marked as they were reported in the public press. ^ The 
re^views that appeared in the second post-Jensen paper is- 
sue of the hevieWj when the media fiad taken it up, were 
.somewhat strongi&r. 

* What .might be characterized as one of the better 

critiques of Jensen's statis/tical methodologies was jnade 

'by Light and Smith (1969). ^They showed th^ almost 9 
points of the 15-point difference in mean iiff^lligence 
scores between Caucasians ^nd Negroes can be attributed 
to the disproportionate distribution of *socip-«conomic 
status as it relates to "race." They further argued 
that, if an unusual interaction ga^ttern is present, the 
entire 15-point difference can be taken in^o account* 
Unfortunately, they presented no empirical evidence for 
the unusual interaction pattern. If they ha5, its social 
consequences would seem to be unacceptable; in some in- 
telligence levels it would be more beneficial to have a 
lower standard of living,' rather than a higher level. 
Few families would seem to want to mowe into abject pov- 
erty for an average "payoff" of perhaps 4-6 imtelligenc^e 
score' points . 1% the end, one might inter^retlight atd^ 
Smith's article as presenting evidence that aJlwted, 
that whatever the difference is" in intelligence scores 
between Caucasians and Negroes, it is surely considerably 
less than IS points. 

Stinchcombe (1969) made several "points about the 
Effect of environment, perhaps the most valuable point 
being in reference to Head Start. He wirote that insofar 
as deficits are cumulative, having an enricheS experi- 
ence for only on£ or two years is not nearly the same as 
living in an enrick^d environment for all of a person's 
first 20 years. The research on the estimates of herit- 
ability in the various studie^of twins was reviewed by 
Fehr (1969). Using a different analysis, ^ lower esti-** 
lilate of the heritability coefficient was found. > 

Perhaps pne of 'the most cogent discu^ions o^f the 
lira^itations of Jensen's study was made by Deutscl>' (1969) . 
As Deutsch,and Jens ec formerly collaborated in earlier 
research, Deutsch might be seen as haying written a let- 

-ter to a former colleague askirig him to ". . . retract 
his genetic conclusions in the lightof^data about and 
understanding of , environmental, factors with which he 

10 . , 



Was apparently not familiar at the^time he wrote his ar- . 
"ttcle" (p. 5521. Oeutsch also pointed out that lawyers 
in some desegregation cases and some legislators have 
used Jensen's article either to ^oid full integration or 
to underfund public education. Ins|K6 time since Deutsch 
made his plea, Jensen Has ,not only retracted his ar- 
gument i>ut, if anything^ iis makij^g stateiftfents 

t|in regard to the importan ce of h eredity. , ^ 
I Several more causVic publications have answfeffed in 

criticism of Jensen's; article. Richar4son and Spears 
(1972) edited a book of essays refutiAg Jensen, including 
-a particularly scathing^ess^y by Swift, who views Jensen 
as a heretic from the^ scientific community. Gartner, ^ 
Greer, and Riessman ^19^4) also edited a book of essays, 
, called The New Assault on Equality^- ythich a neutral ob- 
server might view as leaning more toward a polemic than ^ 
shedding any new understandtHg of the cont-royersy . 

• In 1974, broa^dening^ the controversy, Chomsky took 
Herrnstein to task, stirring up hi»s ov<ji hornet's nest., 
^homsky'^s position might ,be interpreted as requiring the 
suppression an^'abandpnraent^ of research if it led to^- 
findings unpleasant to the Egalitarian pojint of view: 

Turning. to the question of race and intelligence, 
we grant too much to jthe co«temporary investigator 
of this question when we see him faced witl\ a 
conflict of values: scientific curiosity versus . 
social, consequences^ Givei^ift^e virtual certainty 
that even the/undertaking of the inquiry will re- 
inforce some of the most desjiicable features of 
-our society, the intensity of^the ptesiomed moral 
dilemma depends critically on the scientific signi- . 
fi$ancq|-of the issue that .he is cljoos^ni to inves- . 
tigate. Even if the ^cienti'fic significance- were . 
immense, we should certainly question the serious- 
ness of the di.lemma, 'given the likely social con- 
sequences'. But if the scientific interesf 'of any 
' possible finding is slight, %ren the dilejmna ^van- 

ishes. ^ • 

In fact, it;seems that tlie questiph'of the re- 
lation, if .any, between race, ancl intelligence has 
little scientific importance (as it has no social ^ 
importance, except under the assumptions of a 
racist society). A* j)0ssibl€f correlation .between 
mean IQ and skin color i^ o^ not greater scientific 
interest than a correlation between any two. other 
arbitr^rily^eleCted traits, say, mean height arfd 
^ . color of eyeS. (p. 99) 

Banesh Hoffman, in' his ' interview b/Houts in 1975, 
had some Interesting views regarding the'question brought 
up by Chomsky/ ^ " ^ ^ " ^> 

Houts: Let me a^kanother question. Do you 
think that It's morally defensible to inquire 
into such things as' social characteristics.? 
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Hoffman: Yes, I think^so. I t^ln^: that all 
sorts of thinjgs should be inquiye^ into. I 
don't ^hink you should set i^iaits on inquiry, ^ 
although I shouldn't say you should neyer set \i 
limits,. . . .But in the matter of social char- 
acteri^tics, I think we to investigate 

it very; very carefully.'. ^ .But if your mo- 
tivation is honestly ahd^ seriously scientific,' - ^ 
I think that the/ mote We know, on tfie whole, ^ 
thj5 better off^wQ dft^jl (p. /37) 

I have two argujnen|{s with Chomsky's reasoning.^ 
First, the denial of ac;^^s to research, whatever the j 
area, seems defensible ^tok^me on only one ground, thatV^ 
di^'ect harm ys done to the* subjects in the process *^St . 
carrying out the research. Second, it ia of social sig- 
nificance to conduct st4dies regarding individual and' 
group differences. . While it may never have been planned 
for such use', the aata available on various groups have 
been instrument^ in pointing out specific instances of 
social injustice, whether* it has involved blacks, Indi- 
an$, fema^e^s, males, Chicanos, or any other group. , 

^ Finally, Jencks, as I mentioned earlier, has writ- 
ten. ^ provocatiite book that enters into t^' discussion. 
He jfoiiftd, upon preanalysis oif Coleman's data, that dif- 
ferences in qoality of ^schooling have littie r^ejatio/iship 
to adult success; he"t^as<^fred that, since ^be*^ quality of 
schooling may not mat^iW^ *S0, nnuch, schools sHould be^de 
a "^«ri*^''place to be.' also found that familial* in- 
comes (that is, brothers and sisters)' varied almost as 
n^uch as incpmes in general. It was also his conclusion 
that 45 percent (not 80 percent) of intelligence scores , 
were duetto fter^dit.y. , In the eftd, Jencks seems to hav^J 
been placed in the Jensen caipp by th'e anti-Jensens, l^t 
ignored by the Jensen camp. '^•> ' a . ' 

What, then, might be made of the heredity-environ- 
ment issue as it relates to human intelligence? , 'In fair 
,ness to Jensen, he has meticulously followed standard 
practice in both relating his own research ajid relating^ 
the' research of others. Surely, much fault can be found 
not sp much with the research but with the dramatic way 
he attempted to refute compensatory educatioii|. More ^ 
important, it should pointed out that Jensen never 
intended to test the heredity-environment question as * 
an impartial judge; by utilizing the Genetics approach 
and ^attempting to estimate h^, the Ipritability index, 
the v^hdle focus is .od^eritability ./ Further, l-h^ is^ 
not to be interp4rete<^s environmental variance, singe, 
as Jensen states, "In bionietricaV genet ids , the environ- 
mental variance of o^^ is simplv^ the resfBual non-genetic 
variance, nothing more." (l97Sf, p. 173) While Jensen 
never explicitly cautions aga/nst the interpretation of 
hi*s data as a test of^ the horedity-envixonment issue, 
his methodology clearly shj^s him to be studying heredi- 
ty, not the latter. 

Perhaps psychometric tests have reached their 
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plateau in regard to the environment-heredity controver- 
sy. Clearly, better measuremOTts are needed for the 
concepts of "heredity*' and SAsnv^onment"; nor is it 
likely that psychometric device/ will provide them. Ge^ 
netic studies would ^eem to be more up*to that task. 
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A Genetic Approach to 
Intellectual Inheritance 



A second, and. potent i ally more fruitful, approach to 
hereditary diTfe^p^nces in intelligence is through a ge- 
netic, as oppo^d. to a totally psychometric, dpproach to ^ 
studying rnd^idusU 'and group differences. One such 
study that/itiiVht be seen as a rudimentary beginning was * 
conducte<J'by LehrRe (1972a). 

^ l^hi*ke revi-ewed several studies dealing with men- . 
tal xjetardat ion. One continuing, 'aspect of mental retar- - 
datji,pn IS that, i*n almost |Very reported -purvey , th^* 
maj^es predominate ovef females. Lehrke hypothesized 
tKat those genes^that are a major determiner of inte.l- 

Tiigence ar&'focated on tRe'T-cTTj^cjiiasome. OC-_l inkage of 
sipch traits 'would ac(?ount fdr the apparent greater mal^ 
variability. Th4s gre,a»ter variabiTity would result in 
proportionately higher incidenoe €>f both mental retarda-' 
tlon and of hi.gher intell igence .in males. Deleterious 
alleles of these genes could rdsult in mental retarda- ^ . 

*ti6n' that is transmitted as a ^ex-linked recessive. 

^ ' Cehrke interprefed the data- available as supporting 
foujr hypotheses; * > ^ 

» 

* Hypothesis 1. There are major^ genet ic loci re- 
latiT>g to human intellectual functioning that are 
located on the X chrornosome. , 



Hypothesis 2, These genes, if mutated, 'can lead 
to subnormal Intel iectual functioning, including 
mental -retardation, in a X-linked manner. 

Hypothesis 3, One or more, of these genes relates 
to verbal functioning. 

Hypothesis wl. rTh^ deficit re late's- primarily -tcr 
the central nervous system'. (p. 61'2), 

In. support of his hypotheses, he quoted severaJl sources 
that indi'^te the greater variability of male intefli- 
genee scores. Also, Reed and'Reed (1965) are quot<|| re- 
garding the incidence of known retardates: If ^he 
mother but not the father is <;retarded , the probability 
. ^of retardation in .the children is twice as great as in 
families where thd father is retard^ 'b^ut the mother is 
TYOt. ' ^ ^ 

Anastasi (1972) pointed'Out that insufficient cvi- 

t 
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deijce'is available to talk useful ly 'about a ;^theory'' of 
X-linkage tp majbr intellectual traits. She indicated-' 
.thkt the hyj)othesis wduld be useful ly ^tested by investi- 
gating several retai^ded in^ividu,als . Nance and Engle 
(1972) suggested that social reasons may exist fonthe 
jfredominancfe of males being'^'regarded -as retarded. They 
also statedA.in support of Lehrke, that '^Tl^re' i^^^jto ^ 
doubt that tftere ate many sex-litiked fetesslve traits 
thdtt are associated with mental retardation, and that in 
the aggregate these syndromes ^may constitute a greater 
proportion of the retarded population 'than rtio^t people 
are aware!'*'* (p. 625) ^ 

In response to Anastasi and Nance and Eoigle,, Lehrke 
(i972b) clarified several points , that they had raised. 
Wbile no further rejoinders w^e included in this^parti- 
cular is'sue of the American. Joui^l of tMental DeficieTfioy^ 
there could be more to Nance and Engle ^s conunent regard- 
ing how social mores -serve as a basis for decisions about 
institutionalizing retardates than Lehrke is willing to 
credit. Uhfortunately , documenting evi'dence of 'differen- 
tial- bases for admissior^to institutions for mental re- 
tardation is quite difficult, a^nd perhaps impossible in 
regard to making definitive judgments. ^ In my limited 
experience in six month's employment as an attendant' in 
a state mental institution, there seemed to be a behavior 
difference between ,maje and female patients. It appeareji 
Ih^t fenra^res are much^more relucfkntl^ consigneci/ to men- 
tal hospitals than is'^true for males. If'this tendency 
holds for other types of institutionalization, including 
mental retardation, then differences in proportions of 
males and fertiaies in institutions for the mentally re- 
tarded may well have a social *base. 



SEX DIFFERENCES IN MATHEMATfCAL TALENT^ 

Bearing on Lehrke* s hyj)othe5^s as it relates to 
higher intellectual funcwfcning is the research being con 
ducted at Johns Hopkins University Under the direction of 
Julian Stanley.^ In a large testing process preliminary 
to .their major st\idy, Keating and Stanley (1972) found 
an unexpected and disconcerting sex difference showing a 
preponderance of male$ w.ith higher mathematical talent/ 
Using the Sequential Aptitude Test Mathematical (SAT-M) 
a«'a screening device with .396 stftdents participating it\ 
a mathematics content (223 males, 173 female?), they^ 
found that those scoring 610 or over numbered 43 m^*es 
but, no females. On another test administered at|jme 

' same time. Mathematics Achievement Level I (M-U, 22 
males and no females scored 560 or above. ^ 

Astin (1974) sought to investigate 'these sex dif- 
ferences further, B#fh she and Anastasi (1974) point 
out that mathematics scores correlate substantially with 
pessonality characteristics most readily associated with 
masculinity (independence, nonconformity, and unconven- 

•tionai\ty). >ithin female groups, these correlations 



also hold! Astih piiffered several social explanations 
for the results: (1) Girls may become more anxious about 
competing than boys; C2)^ , role stereotyping fayors inter- 
esjf? in mathematics and science, for boysj^and (3) whereas 
''^irls , appeared to like school, highe'r scoring, males re-^^ 
ported a strohjg dislike for school, showing them to be 
more ^nonconforming. Whileu^Astin, did concede 'that a b.ip- 
logical explanation may accoujit-^or sex differences in ' 
scores, she feels the e\iSei\^e is more convincing in the 
jlirect^on of cultural reinfprcement of differences due to 
se^-role identification.' ^ 

• In opting for a iDore parsiinonioui' explanation, 
LehrJce might argue that gene differences by. sex are also 
related to , mathematical ralent. Stafford (1961) showed 
that spatial visual izati^on was sex-linked, with mal/^s 
scaring higjier'. Hartlage (1970)*also investigate^^'this 
same phenomenon with similar results. Bock and Koiakow- 
ski C1973) followed tip these two studies with additional 
(Tata of thefr. own. '* They found that in all three studies 
on spatial-visualization the correlations for father-l 
^daufehter and mother-son wete both higher ithan either \ 
father-son or mother-daughter. In all three studies, 
raa^es scored significantly higher than females. Of ad- 
ditional interest Was the finding that spatial-visualiza- 
tion appear? to b^ class-free, in the sense that lower, 
class .subjects spore as well as middl^ or-*upper clas& 
subjects! . The explanation of fered by Bock an^ Kolakowski 
is that spatial ability is subgtantiall)** influenced by a 
recessive sex- linked gene. / ' ^ 

As a final comment, /it is fair to say tjiat studies 
regarding genetic compon^ts to intelligence' from a bio- 
logical viewpoint ^e aft a beginning stafee, and are by / 
no^eans definitive Jm. th'eij^* results . On the other hand, 
psychometric investigations, wherein -all^ measures us^d 
follow a paper -peKcil or question-answer format, are less 
likely, over the^long fun, to be as convincing as a, re- 
search viewpoint that includes a biological examination 
of the heredixy compoitcint'. 
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Looking at Tests afid Items on 

Tests: Hoffman, ^visited \ 

^' 1 



.The. stock in trade of the test sellers is' the multipU- 
choice question. Students who call multiple-choice tests 
"imiltiple-gues3" tests are closer to the mark than they 
realize. Several writers have expounded on ^specific ' 
items on the various -Standardized t,ests . In the Jifll)^/ 
August 1975 issue principal^ Zacharias, Schwartz, 
Colle, and Butler ci'iticized specific items. By far the 
most interestittg exposure of tests and test makers Was 
done by Huffman (1962). * ' * \ , 

^ While the following item does not appear ^n any 
pi4?lished teit that I am aware' of, it is suggestive of 

the dilemma that face^' test takers: 

/<-'*^ ^ ' - ' 

' * I. 
, America was discovered by 

, (a) Christopher Columbus 

(b) Leif Er^ckson ^ ^ 

\(c) Welsh sailors 

(d) the Chinese 

(e) Indians 

If the student is told to pick the answer that!:' »» is most 
correct," a knowledgeable reader would indeed be per- 
plexed. There is at least some archeologic^l' evidence 
that would allow answers (a), (b) , (c) , and (d) to be 
correct; on the other hand, it is clear* that the Indians 
were already here to greet whomever arrived. Newer ar- 
cheological evidence may show that even the Indians were 
preceded by some other group. So the problem here is 
being clear about what is meant by the. word ''discovered.'* 
Assume for the moment that the keyed answer is 6*) . In- 
terestingly, high scorers might tend to get this ifem ^ 
"right" more often than fower scoreVs, giving this item 
a high "discriminant-oftflidit)^," and perhaps permit it to 
be a siirvivor of an "item an'alysis." Higher scorers 
might use a strategy somewhat like this: the item clear- 
ly does not have a right answer, so which answer ^ght 
appeal to a person who writes t>ests? While answerWfc) , 
(b), (c), and (d) might have sufficient evidence f^jf^ 
them to.be "discoverers," (b) , (c) , and (d) did hot es- 
'tablish their evidence widely -^o Caucasian Europeans. 
Also, the test writer-^probably uses a Caucasian point of 
view toward the term "discover," so that answer (e) ^ 
would be inadvisable; even though the Indians .obviously^ 
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.kn^w about_ America, they didn't tell us. In other words, 
high test scorers may have learned how to "get into the 
"Tlieads" of people who write questions, 

Sec(U'ences^of numbers are popular on several apti- 
tude tests. Consider the following item (Oti-s and Lenon, 
1967, p, 2) ; ' ' ^ 

^ f/hat term is missing in this , series? 

-. 3, 5, 7, 5^, 11,-13 ' ^ -(E, 9, 10, 14, 15) 

The^ elicited answer is, 9, but^mathematically any of the 
answers can be shown to be correct The answer 9 could be 
achieved by the fall^dwing, series : 2n +1,. The answer 10 
could be achieved ^by using:; 2n + 1 + (n-1) (n-2) (n-3) (n-S) 
(iv^)/12. To ^et 8, a similar series can be used: ' 2n ♦ 
lSi(«-l)(n-2)(n.3) (n-5){a-6)/12. To" get 14, the follow- 
ing^ series 11 wdrk: 2n i i. 4- -5{n-l) (n-2) (ri-3) (n-5) 
(n-6)/12. Finally, to get 15, the following may be used: 
~ 2n + 1 + (n-l)(n-2)(n-3)(ji.4JGn-6)/2, 

A worrisome que'stiop may enter the mind of the 
bright student: whije mathematically any of the«answers 

be correct, does the person who made out the test 
Icnow this? Lf the student'goW on the assumption that 
the test is really rather simpleminded (as, perhaps, is 
the test constructor), then the easiest answer'is 9, a ^ 
piece of reasoning that raises a criti*cal issue almost ' 
neve/ addressed either by test constructors or test 
» critics, — How ca n a person who is more intelligent, or < 
more knowle^eable, thai\ the test writer and/or adminis- 
trator be fairly tested by the test and/or administrator? 
^ In the absence of flexibility on ,the part of the person 
who scores a test, it may not always be possible, As' an 
interesting aside, I took' the WAIS from an examiner in 
1964, One of the questions on the WAIS is: "What is 
the population of the United States?" The answer given, 
192 million, while closely approximating the actual 
counted population, was technically incorrect going by 
the test manual, written in 1955. Full credit ytas given 
for answers between 140 and 180 million. The test manual 
was Wdai:ly outdated. Further, it is well known that 
the official population clearly underestimates lower in- 
come minority males between the ages of 18-30, The offi- 
clal Estimate could be off by- at least a few milli'dn. 
Knowledge of this pheno»«ion jnay be costly if the exami- 
ner is not iflexible, (The ^examiner in question iiXie' flex- 
ible. Later he said, "If you said it's 192 million, then 
it's probably 192 million. The test manua> is outdated 
anyway , ") \ / 

What happens when a person taking a t^t recognizes 
an error in the instrument? Jf the experience of the 
stiidents reported by Hoffman is typical, writing to a 
tfest compaily-is of little avail. Apparently they rarely 
respond to individuals writing to que^stion certain items; 
at least they rarely admit to error. While item analy- 
ses are not always particularly useful, if a "distrac- 
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tor" or plausibly incorfect answer isr picked more often 
than the keyed anSwer, it is not wholly i;npos^ible that'' 
a mistake has been made in the key- 

ON DISCRIMINANT VALIDITY, ITEM ANALYSIS, AND REPINING 
PSYCHOMETRIC. DEVICES \ ^ "^^ 

* Standardised tests typically go through a complex 
iteiii tryout period. One step includes finding the^dis-" 
criminant validity of art item. Typically, the percentage 
of people yShos-e total score pllaces them in the upper ^ 
fourth* are compared on each item to those whose total 
scor.e places them, in the lower fourth. Suppose 80 per- 
cent of thos^ in the upper fourth get an item correct^ 
but only '40 percent of thosp in the lower fourth get the 
iteffl correct. The discriminant validity is then .80 - 
.40 =,.40, a fairly respectable discriminant validity. 
Ah assumption made is that people wha get higher 'scores 
know^ more about the aiibject than those who get lower 
scores. What happens if the reverse is true: that is, 
lower scorers are more likely to get the item correct 
than those with higher scores? The item is likely to be 
dropped 'Or rewritten; even if a casual (I'.e., non- 
psychometric: expert) feader.' thinks it ' seems like a good 
\item, he i.s still unlikely to convince the psychometric 
expert of the validity of the item. 

Various other technical details go into construct- 
ing a psychometric device, including finding measures of 
reliability and validity, continual item tryout, etc. 
The non-expert should be advised that the words 'Reli- 
ability;* and "validity," jii*vthis context, lose their 
usual meanings, and instep take pn technical aspects 
that, beyond being bewildering, are highly misleading.* 
Measuring reliabiUty is similar to measuring intelli- 
.aence. Neither can be measured directly, so that hypo- 
tnetical constructs must suffice, . ' ^ 

After attest has been refined througl); item analysis 
^ final form of the test is developed." Th.e refining pro- 
cess is akin to processing wheat; the original product, 
yith minimum processing, might have l)een more fit' for hu- 
man consumption. ' ^ 




Use of intelligenee Testing 
in the Schools 



'Intelligence testing has a lengthy history of /use in the 
public school^,' but its use has never been mwre force- 
fully challenged than it is today. As mentipned earlier, 
in a recent issue of Principal^ Zacharias , ^Morrison, 
PurvijXji Jadil la and Garza, and Lazarus hav/ in varioias 
ways asked for either a moratorium or theiabolition of 
intelligence testing. Jn several large cities, including 
New York,"Los Angeles, and* Washington, intelligence test- 
ing has, dn fact, been abandoned^on a district-wide 
sca,le. 

The Jbasons usually ^iven in support of a morato- 
rium on such tests are either that they, reflect an East- 
ern, Caucasian, middle class bias and discriminate against 
minority groups, or that they are often used for labeling 
children, particularly as ;they are related to placing 
them in special classes. In. addition, teachers and ad- 
ministrators who ha^e access, to the scores of \these tests 
are also presumed to be guilty of so-tting in motion a 
"self-fulfilling 'prophecy" abOut the children. While I 
don*t necessarily disagree with "these arguments, I think 
some distinbtions have to made. , 

Much of the abu^e of intelligence testing is due, 
I submit, to tl^r uses in a negative sense. If* legist 
lation were passed so- that intelligence tests tould b6 
u^ed, only in a positive sense, then perhaps at least 
some of ^hose who wish eithet a moratorium" or abortion 
would be willitig to concede .-a real value if\ intelligence 
testing. Consider the alternative of abandoning stan- 
dardized te^l^ in' general and. intelligence tests sp€ci7 
fically. The stage would he set for statue .quo oriented 
teachers ani administrators to more. intensively act out 
their biases and reward coiifbrming s^tudents. ^ 
^ ' An incident that illustrates the ppint I am try- ^ 
ing to make occurred several year^ ago ip a Northern 
California town., Because a borderline normal female 
was also quite promiscuous, sfchool leaders tried to get 
her placed in an institution for tlfe retarded. /She was 
adm^hiistered a b^vy o^ indiv*i4ualized te$ts. Had she , 
scored below 70,' the school people could have proceeded 
with the placement. Curiously, her scores on these 
tests varied between 71-^75, fiever below 70. To me, the 
incident points out sev««9*l things. The school offi- 
cials tried to use the intelligence t-ests in a negative 
sense; 'they were offerjded by the girl *s behavior and ' 
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wanted to remove her from the iJommimity under tlft .gyise 
that she was retarded. The tests, however, ^ere used in 
a positive, sense; the girl 's*scores having ^xceeded*'the^ 
minimuro, scho.dl officials could not remove her from the . 
community.^ Hati they'had a free haj^, they wbuld hav^e un- 
doubtedl)^ sen^ her to an institution for the mentadly re- 
tarded juit to get he'r.out of their community. . 

Likewis^ intelligence %ests have been said to dis- 
ci*iminate againsY'TffiTitJritie? , ^ chaFfe thft hasQJonsider- 
able substantiati^fam. Consider, however, a black student 
who '^ichieves a score of 140 on the.WAfS; that s^re, in 
itself, would seem prima fdcie evidence of- .superior in- 
telligence-, regardless #)f the opinions held*of this stu- 
dent by the student 's' teachers . To discontinue intelli- 
gence testing would seem to be* a discriminating act a- 
against bright^inority students.. If the rights of a ' 
child are to be considered, then the child's right to^ v 
take a test is at least 'ks great as the child's right not 
to take a •st . • 

The statement in^ the previous patagraph is predica- 
ted on theri being some positive contingencies connected- 
to takihg a testT^ For^ example, schools mdght'have made 
special provisions for student^ who ^core above a pairti- 
culaf score. If, on the other hand,* the only contin- 
gencies of tigsting are. negative, or if the tests are 
given purely for administrative reasons, then I would not 
try to. defend the testing process. 

The legislation I have in mind might require a 
school district to forego placing a child tin a special 
education classroom if he achieved above a given score. 
Nor would the school be allowed to placTe the child in 
the special classroom siihply because he had failed to 
acl^ieve tfie specified score. If the interests of the 
child yere pa'ranvount and clearly supported by law, a 
^ Variec^ group of professionals--including, perhaps, a phy- 
siCT^ar^a school counselor-, a clinical psychologist, the 
tJWl^er, and a ^speech ther^pist^-might be called on to 
assess aspects of the child^S growth, in consultation 
with the panonts and the child. This group would then 
be in a position to make a more judicious choice of help- 
ing the child toward maximizing his learnipg opportuni-- 
ties. If this process .seems to the reader to be a lot- 
tery whete y6u win sometimes, but n^ver lose, consider 
that the purpose of public education is the provisiop o 
meaningful learning experiences for our nation's yoyt)r', 
not the employment of middle class j5<lreauoTatic functiojl- 
aries. Besides, what is wrong wifn ^1 losing ,more^ Stu- 
dents the experience of winning? 

A point should be made here about the history of 
intelligence testing. As Kamin (1975) has pointed out, 
this history includes some unsavory iForef atfiers . While 
Binet was an important exception, _many of thq other 
leaders in the development of intelligeace testing (in- 
cluding Terman, Yerkes arfd Goddard) were proponents of a 
eugerlTc^ view ^ following the pclitlxral minds of the day 
(i9lJ-192S). Concurrent with the construction of early 
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intelligence tests / Spearman developed the 'concept of 
fg^ctor analysis, princip'ally as an aid to understanding 
intelligence. Large-scale jiAlligence te'sting was con- 
ducted on new% recruits duri^Wdrld War- I, usiiig^the 
"alpha" test^,- Intellig'ence testing was thus^ given sever- 
al inc0nti>ies''''for advancement de^;spite the bias of several 
pf the early devel<ij)ei^ r' In the process,, intelligence 
testing has rec'eived the gVeatest attention of the vari- 
ous psychometric devices used,, and as- such", the psycho- 
metric quialitie5''of the more widely distributed intelli- 
gence tests ar# perhaps the most advanced examples in 
the art of psychom'etric testing. If intellige.nce tests 
are of dubious value, then where does this place other 
psychometr,ic testing, siich as achievement Resting, per- 
sonality testing oV attitkdinal ^testing? .If intelligence 
* t;ests are tjo Se -eliminated, then feV tests af any type 
"ifould seem to be free fi^om a successful cjjallejigfe from 
some interest group. 

It seems bothersome -to many crit^^ of intelligence 
testing that the concept' of intelligence is not "pinned 
down" to an acceptable concrete entity^ but rather is to 
some degree seen as an abstraction (or hypothetical con- 
struct) that is measured indirect-ly through the various, 
tests, however inadequate. Hqwever bothersome the use of 
an abstraction is to the criti9S of intelligence testing, 
it. should be pointed out that, with few (if any) excep- 
tions, oth^r personality ^nd/or attitudinal measures are 
even- less well ancho-red'to a concrete reality than is 
intelligence. Those who w^uld argue 
of intelligencje tests because they, 
rectjy) "intelligence" wouTd seem. al__ _ . 
jthe use of attitudinal tests-^^uch as the' measure of au- 
thoritarianism ^<ievi-sed by Adorno, . ^ 

Adprno and others (19^50) instituted an. F scaie*tx? 
test for rigidity of thinking and agreement with an ultta 
right (fascistic) viewpoint. One Idgical crit^ism that ^ 
hks been raised is that this kind of test is ^ insensitive 
to discovering nfembers o.f^ the ultta-left who, are, if atiy- 
Vthing, even more rigid; Tapp (1975)" has termed these 
people "totalitarian liberal^." Rather than discard the 
F scale because it h^ been shown to be' inadequate in 
•some circumstapces , it makes more sense empfirically to ^ 
note itS limitatiofis ,' however numerous* bu^ to allow its- 
continued^ use „/ot the sake of eventually bptter* under- , 
standing the.h'uman condition. ^ * ^ 

Should intelligence testrng'be aban3oned?- While , 
'any tasting can lead to abuse, their posilt^Ve dse seems 

^ ite thesLjM^tlc 



too impoi;t^t ^o. simply elilninate th ^SuJalt the long 
run, society would be the^loser. \ , 



w^uld argue agaii^k fhe use p 
ause they^den't irofsure (di- ^ » 
uTd seem, also to arfeu^ against 
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Use oj Standardized Achievement 
Tests in the Schools 



Consider for a moment an al 1 -too-frequent type of testing 
abuse. A school 'district with perhaps 10,000 students 
administers to each" student each year a complete 'test 
battery such as the Iowa Tests of Basic Skills or .the 
California Achievement Tests. If the district employs 
500 teachers, T:hen the 500 teachers spend the equivalent 
of a full professional day each administering the te^ts, 
and the 10,000 student's spend the same time tak^^ig the 
tests. Consider the costs. First, the test forms and 
answer ' sheets have to be purchased. This outlay could 
easily run several thousand dollars. The tests have to 
be scored; if the teachers have to do the ^coting, they 
•are likely to spend another 20 to 40 hours in the drudg- 
ery Qf test correcting. If the tests are'scordd by opti- 
cal scanner, the cost is likely to run at least a few 
thousand dollars. The teacher's time, if prorated at $50 
a day., comes to $25,000.. If the students* time is worth 
anythi-TTg (which it very well shoul<^ be), the students^ 
have collectively spent 60,000 to 80,000 hours taking 
tests. 'At $2 an hour, this "expense" runs at 1-^st 
$125,000. hveR if this "cost" is disi^'egarded i a^ i,s gen- 
erally the gase^/^other student cosO shoiild bi understood. 
Many students can, see no relevance to spending so much 
time taking tests. Further, a negative affec^ toj^ard- 
• test-taking is' easily envisioned in'tfi'is massive lasting - 
effort. Giving- the tests (as they are usu^ly adminis- 
tered) in massive doses, fatigue must s^lrely play a part 
in students' scores. Considering t^ ^achers again, 
the boredom that ensues from admir^tering tests hour af- 
'ter hour i<> taxing>;-*if the teacher^ is highly motivated 
to enhance a child's learning experiences, administering 
standardized .tests is extremely frustrating. 

What are the "payoff^'** from this massive testing 
effort? In the best possible world, the school district 
eihploys 20 to 25 testing-subject matter specialists, who 
are able -to £ran^i%te tjpst results into remedial and/or 
accelerated j^rogiafc foV the individual student in contin- 
uous consulI:atidriiwi\:h th^ classroom teacher. These 
testing-subject matter experts'*<lcv not now exist in any 
number sufficient for present testing programs; their 
non-existence is ij^some measure due to, the school dis- 
trict's unwillingness to expend monies for the necessary 
number of specialists ^to implement such "a prograip'. 

More likely, the school district will use th^ 
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scores to arrive a^t some normatiye data on comparative 
achievement levels; if the data is embarrassing, suppres- 
sion of 'the information is. all too "common. " 

Teachers are not now likely to looR closely at 
achievement test scdre^. All too often, the test scores .y 
sit in files and are only occasionally consulted, if a 
parent or counseloi: or other agency insists on the inf<Sr- 
mation. Schools rarely Embark upon any systematic usage 
of test information to enhafiC^tudent learning experi- 
ences . « 

If the costs and probable usage are compared, there 
is no justification for massive standardized testing. If 
finding nojnnative data to compare the school district to 
a national scale is the de^re, then it can be achieved 
without this ^enormous human cost. If students w^re ran- 
domly chosen, 50 each at grade levels 3, 5, 7, 9, and 11, 
so that a total of 250 students wouid be tested, the 
same usage could ensue and at a greatly reduced cost, 
both in actual dollars expei)d^and in wasted time. If 
a moratorium is in^order, that moratorium should consid- 
er mindless massive testing programs that exhibit llrttle 
payoff for the costs involved. ^ 

^ In fairness to the test constructors (of, at lea^t 
the Iowa Teets of Basic Skills), dt sh6uld be said that 
the tests themselves are often technically sound; pro- ^ 
perly ,used (with a sample of the students rather thap 
the whole population),, they may yield useful information, 
improper use, like overeating, ha^ undesirable side 
eff.ects . 
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Why Do Schools Administer 
Standardized Tests? * 
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Why jdD schools administer standardized tests? If the 
qu^tion is asked of students, a plethora of answers may 
be given. One answer that probably does not occur very 
often is,, 'To help us where we have problems.". If a 
teacher is asked this question,^ a likely response is, "My 
principal 'forces all teachers to give them." Central of- 
fice personnel may either res^ftrt to the "district policy** 
^answer, or perhaps Indicate that the "pub^lic" expects the 
schools to be accountable. A most unlikely answer is, 
"Because it enhances the learning^activities in all 
classrooms." If Perrone's ^'1975) experience is typical 
(and there seems to*<| no contrary evidence), then the 
scenario just described may ^e apt: 

As this eventful Week came to a close, I called 
an assistant principal at a junior high school in 
yet another school distVict, opening the conver- 
sation with, "How are things going?"^ His re- 
sponse was not what I had anticipated. 

-^'Terrible!" he exx:laimed, "We gave the 

test on Monday and Tuesday, and the school year 
. hasn!.t settled down. yet.. It's like this e^ery ^ 
school year." 

"Why do you give the test?" I asked, but I ^ , ^ 
knew why: *it was a systemwide activity.. Then I 
asked vhat -was done with. the test result!^, and 
the assi9^An,t principal said, "We just file them ^ 
away; no one looks at tbem." (Perroney 1975, p. '97) 

In other words, for so many, the reason for giving 
standardized tests this year is because/they gave them 
last y^ear, and. for every p'revious^ schrol ye^r in anyone'^s 
memory. To discont inue the^ standardised testing program 
(if this is' seen as TTea^oned soluiioft) would mean rock- 
ing the boat-^-and rockirig fhe boar is almost an extinct^ 
behavior among school administraftors. 

But how* should \he schocHs replace the standardised 
tests? First of all/ it sjniuld free up the time previ- 
ously lised for testilig fer tjje teachers to use in 'more 
meaningful activities^With^ the students, ^^^^^ money 
alloca^d for test-j^ving cannot be shifted to pther pre- 
viously .negleotejKareas, then perjiaps .some tar reductions 
might occur, » - ' 

^ If ma^ve standardized testing is to be abandoned 



in a given school district, it should be pointed' ouf that 
the process of , abandonment is perhaps even more important 
than t^ outcome. Serious and reasonable' discussions at 
all levels are necessary. Questions have to be raised, • 
such as: "Why do we give (take) these test??"; "How 
^migh.t. thfe test scores be meaningfully employed by the 
student, the^ teacher , the parent , the administrator, or 
the researcher?"' we meaningfully use the test?"; , 

"Is the test worth'*What it cosfs (in.dollars and in 
time)?"; "If we have a standardized testing program, is 
it mandatory^ or could^ classes (stijdfeiits) elect to parti,- 
cipate or not to participate?"; "kew look at the tests; 
do they te^t* relevant areas?"; ^Are fnetestfethe most 
economical way , to sample necessary instructtoiyal areas, 
or are other testing forms (such as problem solving) more 
relevant?" ^ "V / 

If all who*are i^olv^ with the testirig area r^a- ^ 
son together, then whoever their reasoned, decision, that 
they have -wrestled with the necessary questions and ar- 
rived at a decision is ^fficienty Even if they .^decide ^ 
to continue the »^ndar(^ized testing program in some 
form, they are more lijcely to understand the rationale, 
for testings and are more likely ta make use of the re^ 
suits ' ^ 



po We Need This Big an 
Educational Testing Industry ? 



The dimensions of the educational testing industry are 
irartense; Kohn (1975) has estimated its* yearly volume at 
$150 million. Corporations such as ^he Education Test- 
ing Service (ETS), Psychological Corporation, American 
Colle^ Testing Service, and the California Test fitireau 
of McGraw-Hill ^re fairly widely known. Textbook pub- 
-lishers^are also heavily invested in the testing indus- 
try; Houghton-Mifflin, tor one, publishes Icwa Teste 
of fasia Skills, the Lovge^Thorndike Intelligence Teste; 
and the Stanford^Binet Intelligence Scale. ^ These cor- 
porations are byt the tip of the iceberg* There is a 
startling array of published tests available to ^est 
every conceivable aspect ef human endeavor. Buros (1974) 
lists more 'than 1130 pubU^hed^roup intelligence tests; 
more than 20 individual tests are available. The tptj^l 
number of-^published tests available easily numbers- in 
the thousands^ (Buros, 1972, lists 2,585 tests). 

(tore important than the number of tests is the ef- 
fect testing has on humaA lives. Students typic^ly have 
, to .take (afid pay for) a college admissions test even 
while they are in high sbhool. Companies such as ETS 
have developed a •whole battery of achievement tests th^t 
are now accepted for college^ credit ^ the student, scores 
high enough. It is now possible for -a student to achieve 
one year's credit at a designated college or university 
in'^one day's effqrt^^ki^g EtS's advai\eed placement tests 
before the student has even set foot on the campus. 

I.f the student wants to go to graduate school, then 
ETS's Graduate Record Examinations (GRE) are usually rfe- 
quired. And, pity the student in a competiTive academic . 
area who may have excellent undergraduate grades and ex- 
cellent recommendations but only mediocre GRE's. , Simi- 
larly, t^ere are tests for almost ^ery professional 
school, but surely including law and medicine^ 

If the* studeAt rteeds^ a loan^ ag^in ETS comes to the 
resqlie (at $3.75, thank you). The student fills out an 
optical scanner shjeet, and i^omehow this is better than 
showipgi need to lo<iaI officials. 

Suppose the student 4ias to show. a reading knowledge 
of a foreign langua|e for Jthe Ph.D. Do the^ professors 
in the appropriate foreign language test the student? 
Not on your life. ETS cbmes to tfie rescile again. 

One of the funnier incidents in my professional 
experience is due ^o the uSe' of the ETS French Examina- 
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tion. The graduate schoof where I ' taught decreed that a 
student should score at the 33rd percentile (or higher) 
to show a reading knowledge of the language. Thei« were 
three forms of th^test, one for students in the sci- ^ 
ences, one for students in the humanities, and one for 
students in the social sciences. ' However, the first part 
of all threfe'tests was identical. Because students in 
the humanities and social sciences naturally^have a . 
greater language backgrour\d, their scores are usually 
higher. Thus getting, a score of the 33rd percentile is 
much harder for students in these areas than it is for 
those .in the sciences.* As it happened, one could pass 
the science test *by doing x)nly the grammar sectipn and ^ 
• omitting the rest of the test; such was not possible for 
the other two tests , In fact, s^ey,eral sfudents in the 
humanities ^and social sciences section "failed" the lan- 
.guage examination even* though they had obtained more than 
a hundred points in excess of the score- required to pass 
the science examination. As an alternative strategy, I f 
suggested to several affected*^ students that they take the 
science examination and do the best they could on the 
grammar settion. 'Uftryou've done well enough,'^ I said^ 
'-'you don't even have to look at the science i^eading sec- 
tion of the test." By hook or c^ook, students started 
passing the language' examinatron significantly moi'e often. 
• _ One additional aspect of the testing industry that 
should be considered is its secrecy. Kohn (1975), in an^ 
attempt to gather information about the testing industry, ' 
talked to representat ives ^from the major testing firms. 
None of the people he talked to was willing or able to 
estimate the nmnber'of students tested each year in the 
United States. They also would not reveal how many tests , 
their individual companies had sold or scored. The. only 
comment m3de in regard to overall size of the testing 
industry intimated thatf-it was . too' smaHj and should nec- 
■ essari ly;.become larger, ♦ 

A short reflection on the amount^ of autonomy given 
to the industry by the colleges and universities is noth- 
ing short of astonishing. One wonders what, will ^be the 
next coup that the testing findustry will mpfce. Are they 
really very *far from offering dWreeaT? "It/has been said, 
in fact, and only half in jest,-;that the largest univer- ^ 
sity in the world is none other-^than ETS./ Siireiy, more 
college credits are achieved by taki^ng E^S tests' .than any 
one university offers. • * ^ ^ 

One of the more eloquent arguments ^gainst the 
testing industry was made by Karier (1972). In it, he 
indicated the testing industry as being the servant of 
grower, privilege, and status, A parallel can be drawn 
^ with the large philanthropic foundations. Tlie role of 
the foundations (tiarnegie. Ford, and Rockefe^liejr) , as a 
"fourth" branch of government, is said tb be that of a 
maitipulator of educational thinking for the coT\porate • 
liberal state. The pTrojept^ of the foundat ions been 
a pressure valve for society; studies^ ultimatelyT "prove*' 
the system is^workirtg, but needs some adjustment^ As an 
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example of how tfi^d corporate state has attempted to in- 
fffuence the direction of public education through massive 
testing, Kariex cites evidence regarding the testing done y 
with \he national assessment program: The establishment 
of the national assessiy^nt- is seen %^ a verx 'ief ifrfte 
step -toward a national curriculum. ^ 

* A similar view Is given by Kamin, (1974,'^ 1975) . Ka- 

min reviews the history of the development of intelli- 
gence tes.ting and notes that, w^th the exception of Bi- 
get, many of the earlier developers of intelligence test-' 
ing had a decidedly hereditarian view of intelligence. 
This heredi^hrian view was, in- turn, made to serve the 
politics or those who wanted to impose differential im- 

* migration quotas on the so-called '^genetically inferior" 
peo]^les'of the South Mediterranean and Eastern European 
areas, * " 

Should so large and intrusive a testing industry 
be encouraged to continue? Surely the testing bureaus 
would answer not only in the affirmative, but with rea- 
sons for. why the testing industry should and will get 
larger. But another ascendant point* of view, a point of 
view of students and faculty, is that we don'»t need one- 
tenth of the ^'services" testing firms furnish. Many of 
the services do no more than help enforce the status quo. * 
What, possibly could be more indicative of enforced sub- 
missiveness than for a tl^ird grade child to sit. for six 
hours taking tests whose purpose no one seems able to 
understand? The teacher doesn't have the authority to 
excuse th^ child; the principal may also feel the lack of 
authority. Higher up the chain, they are so far removed 
that the anguish of the individual child is quietly 
laughed off. What function do such experiences have in 
a democratic society? , *- 

If the testing industry .grew "j^ust like^ Topsy,*' 

Nthen is it not time to grea|j;ly reduce the size of the ' 
Breed? . ^ ' 
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An Interim Solution: Teaching 
^Students to Take Tests ' 



Even if those irx the testing industry,, on their own, 
'were suddenly to reverse their growth and retrench to a 
former shape, it is likely that their tests would be. 
with us for a long time to come. In any case, what's 
more probable is that the immediate future will bring^ 
continuing growth. Thus, students should be prepared 
for the testing onslaught. 

One example of preparing^ students for tests was 
given in the previous chapter. Perhaps where such coach- 
ing is most necessary is with younger students who have 
to endure the Ivwa Teats pf Basic Skills or the Califor- 
nia Achievement Tests* I am not implying coaching in 
the illegal sense, as in tf*^ Turnkey project in Texarkana 
in the earl^ 1970s, where a "large-$cale performance- 
contracting experiment was accused of using actual test 
items in their instruction prior to the scheduled testing 
Rather,, preparation, much as a baseball coach gets a team 
ready for a game, is helpful. , Students should no£ have 
to encounter a new testing form when thie' tests are^taken^ 
That is somewhat like a Little ,League player going to 
bat, striking out without taking the cat off his shoul- 
ders, and then remarking, "Gee, I never saw a left-handed 
pitcher before!" Students should be allowed some experi- 
ence with the multiple-choice format and its ludicrous 
idipsyncracies . They should know about time limit^^ and 
whether or not there is a penalty for guessing. If /there 
IS no penalty for guessing, then an answer should be giv- 
tn, for every item, even if the question has- not be,eti read 
Arnd even if there is a. penality, the child should know 
that 'he is better off (in terms of the score) sif a guess 
is made, if ajiything at all is known dbout the area being 
tested. As an example, suppose a question has four an- - 
swers and the child 'knows that one of the answers is 
wrong, but the other three are possi|?ly correct. Because 
the likely penalty for guessing is C - 1/41 (Correct - 
1/4 X Incorrect), knowing that one answer is incorrect 
moves the child's^ probability of getting a'positive score 
on this item above zero: * probability = 1/3 - 1/4 = 1/12. 
While this Value is only Slightly higher than zero,'"be- 
cause it is above zero, -'the score is maximized if a guess 
is made. 

Some of the coaching I have in jnind even makes me 
cringe;' however, the point is this: if students are to 
maxim,ize theijr scores, they do need some battlefield ex-. 
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pe?ience. Thus, the teacher is encouraged to , write some 
ambiguous items similar tq those found on tests ♦and let 

^the students cut their teeth on a miniature Version of 
"multiple guess"; this is somewhat akin to telling the 
players to get ready for sonje bad umpiring. 

If these suggestions seem unrelated to a child's 
educational experience, consider that the same could be 
fsaid for the festing. The suggestions clOarlx have noth- 

^ing to do with actuaT learning; their value is survival 
in the testing jungle. ''^ . - • ' 

But is is important, too, to go beyond the negative - ^ 
aspects of survivorship to where positive recommendations / 
can be considered for creating a clearing in that jungle. 
School districts, for one thing, should be held account- / 
able for their testing programs. Perhaps some of the 
tests ean pass the acid test; many^ would not.' Purely, 

. district-wide massive testing woOld serve as a useful 
*t^rget for accountability and possible litigation. A 
scKool district should •have to show a ledger giving the 
coj,^ an4 benefits fdr the various tests usedt Costs ^ * 
must #urply include feach^r and student time. Fo;:.thos^' 
tests that do not have a favorable payoff relative to the 
cost, changes should be mandated. 

Second, it is clear thaj^istciet-wide^^use of in- 
dividuafized intelligence tests is\too expensive in time 
and money; giving' tne tests on a selected basis may be 
useful, however. Suppose a black mal-e, child is being' 
considered for special e^«fC^tion on the basis of teacher 
recommendations, test sifores and whatever else go^s into 
the decision. The ehijd ajiould have the right to'take an' 
intelligence test undet circumstances that maximize his 
score. For example, a black, male examiner Who is able^ 
to elicit the child's best effort would be a minimum. 
Th^ test (pt test's) chosen should as nearly as possible 
reflect experiences that'would normally be available to 
him. If under these maximized -conditions the chdld does 
iW fact "pass" the, test , the learning , experiences, should 
be structured so that the^c^ilci might be integrated as 
qiickly as possible into a. "regular" classroom setting. 
/ ^One other word of advice should be followed by ^ ' 

£^yone who has any coi^tact with ^ny kind of test score. 
Before interpreting any score, the person should first 
take the test under the same conditions as do students. , 
If little value is seen in the test, then the same Value 
can be attributed to a test score. 
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A More Lasting Solution : 
Constructing Locally -^Jseful 

r 

Criterion-Referenced Tests 



For most learning Activities, it would seem that the most 
logical people to write the tests would be those who are 
involved in thpra. Thus teachers, ibtally-eraployed test 
specialists (if , aiiy)\ and students might all have a hand 
in test construction. * ♦ 

Implicit in the proce^ss is a placement of stronger 
responsibility^pon the teachers and students than typi- 
cally has been given to them. A*fi^st step is the deci- 
sion, as to the material to be learned^ How is the mate- 
rial .Import ant? What are the specific goals that need to 
be met? After having completed a particular course or 
uriit, what ''survival value*' has the learning experience? 
That is, what is it, either i^-the subject matter or in 
the method of learning, »that will be useful now or later 
on? How and wl1e"re will it be useful? 

The ki nd s o f questions implied above are -somewhat 
similar to the concerns of those who have emphasized be- 
havioral' objectives (such as Mager, 1962) or criterion- 
referenced tests (such as Popham, 1975); more simply, 
they are questions thkt a thinking teacher is likely to 
ask in regard to a learning experience. 

Also,' those involved with test Construction would 
be advised to become somewhat more knowledgeable in tes 
construction. While attendance in university courses 
would sometimes be helpful, test construction courses 
tend to emphasize imjiltiple-choice items' to the neglect of 
other areas.' Multiple-choice items have sever^ draw* 
backs. First, from the viewpoint of the item writer, it 
is extremely' time-consuming to write multiple-choice 
tests. Second, and more important, the multiple-choice 
format is not always a useful manner to test a person's 
understanding of an area: If, for exampj^ the intent of^ 
4i course is ^o teach, the students to write short essays/ 
it would appear that the testing situatibn would have the 
student write short essays. 

"While allowing students to **test out" of a course- , 
for credit seems justified, it is ironic that universi- 
ties sO readily accept "credits" from ETS and yet Often 
deny students the opportunity to "test,, out" of a course 
by taking an examination wriften by local people. Per- 
haps imiversities have not , been re^pOnsi^^enough to the 
needs of students in, regard to .credit by examination. 
Given the prod by ETS, those institutions that* fail to 
respond to student Tieed by developing tests locally are 
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accepting an even bigger intrusion by ETS in their future. 

What is: the place of standardized te3ting in such a 
scheme? I^the test exactly fits the goals of the leam->„ 
ing experie^ice, then of course the standardized test is 
appropriate,' This might.be particularly true- in skiTl 
courses such as typewriting or in courses based on a na- 
tional- science, curriculum. 

Perhaps the most notable changes implied are that 
fewer tests would be given, and the tests that are* given 
can be directly related to actual learning goals. Teach- 
ers have been making but tests for almost as long as 
there have been teachers, If teachers are giv^ appropri- 
ate instruction in evaluating the learning activities of 
students, it would seem that their teaching activities 
.could be enhanced. Foi; that mlitter, Students should be 
instructed by teachers,^i« tflfe art of making tests. Among 
the outcomes bf* attempting to compose such questions, 
students should be able to better understand their learn- 
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An Alternative -Solution : 
Constructing Criterion -Referenced 
Work Samples. 



Perhaps a most useful solution to tbe testing problem 
the construction of criterion-referenced work samples. " 
What, exactly, should students be able to do when they 
complete a unit ot a course? The testing should elicit 
the same behavior as the stated goal. If the goal is to' 
have the individual student master the use of laboratory ' 
equipment, then a multiple-choice test will rarely yield, 
a satisfactory measure of that criterion skill. 

One perlexing 'quality of some standardised tests ' 
IS that students who have not taken the course supposedly 
measured by the test do as well or better on the^examina- 
tion'as do studerits who have taken the course. This hap- 
pens particularly in housekeeping--re lated courses, such 
as consumer economics, offered in many high 'school cur- 
ricula. Several explanations could be offered, but a 
rather straightforward one is that the material being 
tested is learned in many sources besides classrooms. 
Watching television, reading newspaper columns such as 
Sylvia Porter^s, visiting tfie local supermarket-, reading 
advertisements, and related activities that are oft-en a- 
vailable to many people beyond those in a course iri con- 
sumer eccJnomics may well be sufficient to demonstrate 
competency. Rather than trying to devise tests that al- 
low those who take the course to demonstrate higf^^r pro- 
ficiency, it seemSHiseful to point out that people. who 
9o the activities necessary to score high or) a test are 
performing exactly as students might who took a course 
in consumer economics. The morfe advisable solution is ' 
to accept the .time spent watchiilg ^re^evant televisioyr 
programs, ^ reading economics columns, and going to the 
supermarket (which in'^educationese is called a field 
trip) as anr alternative learning experience, 

AN EXAMPLE OF USING WORK SAMPLES ' 

the , supposed reasons for requiring graduate stu- 
dents to take courses in -applied statistics is so that 
tKe student later will be able to apply »what has been 
learned in their own research. It would seem reasonable 
then to construct the course in such a way that, the stu* 
dent might, as' an end product,' show competence in execut- 
ing a research project, even though the dimensi6ns of , 
that project may be necessatily limited due to the con- 




straints of time, experience, and the availability of re- 
s6^arch subjects. In my experience, such constraints have 
not beep nearly as important as INwould have expected; 
students often have better access to researcHable-data 
than many o*f ns would guess. ' ^ 

To take the ^oal one step, further , ^ one of the stat- 
ed reasons for conducting graduate research is the publi- 
cation (at least in part) of the research conducted. 
While, on the one hand, this clearly would be an indefen- 
sible goal to require of all stS*lents in a statistics 
course, on the other band, usin^g ^ research journal *style 
has several payoffs in terms ^^the expectancies of grad- 
uate school,. To master a res^mh iournal style, it is 
useful first to have read severa^lWier articles^ in onels 
own field. Because the students are becoming familiar 
with the- state of research in their area of interest, * 
they can personally evaluate the usefulness of their own 
effort. Al^o, some students do, in fact, publish their 
papers eventually. 

Note that nothing has been said about typical 
classroom tests. Indeed, it may'be^hat tests may inter- 
fere jwith the students* and i»(structor 's goals, rather 
than enhance them. It is prdbably true that students us- 
ing the approach just described learn in great detail on- 
ly thase statistical technitme^ appropriate to their spe- 
cific project., .In many cas^, they will be learning sta- 
tistical techfnique? jpt ev6n\covered irv the main body, of 
the course. But then, isn't that what learning research 
techniques is all about? Does it not make sense to learn 
about a techa^ue, in detail^, as an application presents 
^4tself ,* whetfter or not it is required in a formal re-- 
search course? Such a teaching method is neither parti- 
cularly new (a previous article appeared in Williams, 
1970) nor unique; Noyick^and Jackson (1974)-have also 
been using a similar technique in teaching Bayesian sta- 
tistical research methods. 

CHOOSING GOALS " f . ' ' 

Clearly knowing one»s goals in teaching or direct^ 
ing any ^earning exjperience is a first step toward con- 
structing criteriori-reff renced work samples. Perhaps 
helpful' in delineating those goals is Maget 's ' (1973) - 
book.* Having chosdn the goals, the use of a circuitous' 
route through standardized testing would seldom make 
sense. Surely, tests might b'e used in a supimative sense 
to' ensure that skills haVe been acquire^.' But acquir^gg 
skills '^is rarely ai sufficient goalmn cpgnitive-oriehtea 
learning experiences^ rather the nmlization of, those 
acquired skills--mastery--seems to make more sensob* " 

A T}x>te of* caution is- in order. Edycators should*"* 
be, cognizant of different teaming goals evert within an* 
extremely homogeneous classroom. Not a]ll students are 
prepared for a wprk sample approach to evaluation. In 
,that matching student learning' styles with' their actual 




learning experiences' more important than the utiliza- 
tion (h any single teaehing strategy, it is well to ,re- 
membefr th'ajt sojtte sttidepts are so oriented to the tradi- 
tiojRil classrooA -procedure that, at "least for such stu- 
dents , le^jjjjing expediences in a traSitional foa^nat 
^D^ut graJiiunJy moving* toward a m(Jfe open format^ may . 
maximizQ their present l^rqing experiences . 
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What, tben, is this third view of educational testing? 
If -the firs tr two, views are seen as, being nearly Aamet- 
rically opposed to one another, one advocating a continu-* 
ation (and perhaps §n increase) in the present reliance' 
on.stafndardizjd testing (and perhaps testing in general), 
and the seto|P view favoring the .discontinuation of stan- 
dardized and i-ntoj^gence testing, ^Jien the third view 
advocated in this f^er might be se'en as being somewhere - 
between thos% two extremes • ^ 

The use of standardized tests has tended" to make 
tq'^chers functfonaries in the decisa^n^makin^#ocess re- 
garding t)te/outcomes of the educative proces^^^ th^^ 
standardizfed^^^fee^ is seen as "the criterio«/''^R>vaffve 
educational practices ar$ dAscoUtaged unless tq^ can ^ 
show some competitive ledge on "the* criterion.*' This 
tends to block the development of alternaJtrte goals. 
Rather than^help the teacher become .a fully functioning/* 
professional, and more, proficient in measuring stu^ent'^ 
progress-, the job of testing is given over to the "ex^* 
perts*" That is not tg deny that educational testing ex- 
pertise is a valuable commodity, but it shquld be npre ^ ^ 
widely diffuse^'Smong those actually involved in|^he 
teaching-learning process, Inherent^in this^ criticism 
is a plea for alternative .ways ot measuring goal attain/- ' 
ment. Hpwever useful the multiple-choice format is, 
other mea;3ures of expr^s«ing goal attainment^'have been J 
-'neglected to our detriment . r- " J^' 

Perhaps the most damning criticism of those in the^, 
testing movement isN^t that they dpn't have^ a useful 
product, but that their product ^s.'practicaT use becomes ^ * 
an abuse. To require every student in a school setting* 
*to speiTd six or eight hours on an examination^ seems in- 
defensible'. The incorrect interpretation of test re- 
sults, particularly as 'it applies 1?d intelligence" tests^v 
i^^also indefensible^* On tfie other hand, the abolishment^ , 
of the tesfing movement, as is^ sojneti^^iie^su^ujsted by 
'its detrag|p^s, sells ill-advised. Tests do. provide us 
with anomer perspective. • It is^ot particularly botberj| 
some to he^r someone remark about an intelligence test, v 
"In that this sfudent is a bilingual Chicdho, F don'^t 
*£hink we can say^ a score of -SQ. does the studefft* justide 
At the same time, it woyld be \n extreme injustice to> 
student if he scored 130, but^ad the ftst disfegardM ^; 
on the besis of its b^ing biased. 
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If we accept that tests can be biased, and if 
accept that people c4fi be biased, then another approach 
is to understand' the nature of the bias. But, in fair-^ 
ness to the student, 'the information available shoul-d be 
used' in such a way that' the student's learning potential 
can be maximized. The construction of newer tests, which 
either remove the bias or can be used to enhance .ji stu- 
dent's learning, is helpful. Just as some students dbn't 
perform well on some tesXs, other, students don't interact 
well with some teachers. To label either group on the 
basis of a limited hit of ^information is perhaps the 
wopst b^^s of alii ^ t 

hitat should be-the fate of the standardized testing 
movement? The "fate.'' §uggested here is that those who 
are meet involved in tha testyig process should, ^in some 
meaningful way, begin Warning together ^oii 'the uses and/ 
or abuses of-testing (or for that matter, the uses and 
abuses^f not testing). Clearly, no natiohaKmandate, 
whethePit favors some type of moratorium or an extension 
of the testing movement, wou-ld^allow sufficient individu- 
al paf tic ipatioh in the reasoning process. 

When students •ask, '^ifhy do Miie have to take this 
test?", it is hoped that the teacher and/^r principal 'can 
offer sound educational reasons as it relates to the in- , 
dividual' s learning experience, ^f those reasons are not 
satisfactory and the studei^t is well apprisecl of the cbn« 
tingencies of npt taking tK^ test, the^ the .student could 
be -excused from the testing process. Perhaps as the use 
of tests becomes an outgrowth of a participatory democra- 
cy, wherein the various persons involved reason togeth'er, 
the prbmis^d of the testing industry , to help enhance stu- 
dents' learning wQuld start to be fulfilled. Under 5uch 
a system, wherein each person- understai%ds and willingly 
accepts the rati^nale^ for the test or tests utilized,^ 
testing would b^een #(2lnd function) as an integral part 
"of the" learning process, rather than as a 'bulwark to ^a 
burgeoning bureaucracy. 
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